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ABSTRACT 

Using detailed mock galaxy redshift surveys we investigate to what extent the kine- 
matics of large samples of satellite galaxies extracted from flux-limited surveys can 
be used to constrain halo masses. Unlike previous studies, which focussed only on 
satellites around relatively isolated host galaxies, we try to recover the average ve- 
locity dispersion of satellite galaxies in all haloes, as a function of the luminosity of 
the host galaxy. We show that previous host-satellite selection criteria yield relatively 
large fractions of interlopers and with a velocity distribution that, contrary to what 
has been assumed in the past, differs strongly from uniform. We show that with an 
iterative, adaptive selection criterion one can obtain large samples of hosts and satel- 
lites, with strongly reduced interloper fractions, that allow an accurate measurement 
of Csat(ihost) over two and a half orders of magnitude in host luminosity. We use the 
conditional luminosity function (CLF) to make predictions, and show that satellite 
weighting, which occurs naturally when stacking many host-satellite pairs to increase 
signal-to-noise, introduces a bias towards higher cr sat (Lhost) compared to the true, 
host-averaged mean. A further bias, in the same direction, is introduced when us- 
ing flux-limited, rather than volume-limited surveys. We apply our adaptive selection 
criterion to the 2dFGRS and obtain a sample of 12569 satellite galaxies and 8132 
host galaxies. We show that the kinematics of these satellite galaxies are in excellent 
agreement with the predictions based on the CLF, after taking account of the various 
biases. We thus conclude that there is independent dynamical evidence to support the 
mass-to-light ratios predicted by the CLF formalism. 

Key words: galaxies: halos — galaxies: kinematics and dynamics — galaxies: fun- 
damental parameters — galaxies: structure — dark matter — methods: statistical 



1 INTRODUCTION 

Ever since the "discovery" of dark matter astronomers have 
attempted to obtain accurate measurements of the masses 
of the extended dark matter haloes in which galaxies are 
thought to reside. A detailed knowledge of halo masses 
around individual galaxies holds important clues to the 
physics of galaxy formation, and is an essential ingredient 
of any successful model that aims at linking the observable 
Universe (i.e., galaxies) to the bedrock of our theoretical 
framework (i.e. dark matter). 

The main challenge in measuring total halo masses is 
to find a suitable, visible tracer at sufficiently large radii in 
the halo potential well. Traditionally, starting with the ac- 
tual discovery of evidence for dark matter by Zwicky (1933, 
1937), astronomers have used the kinematics of satellite 
galaxies. Since the number of detectable satellites in indi- 
vidual systems is generally small, this technique is basically 
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limited to clusters of galaxies (e.g., Carlberg et al. 1996; 
Carlberg, Yee & Ellingson 1997b) and the local group (e.g., 
Little & Tremaine 1987; Lin, Jones & Klemola 1995; Evans 
& Wilkinson 2000; Evans et al. 2000). However, one can 
stack the data on many host-satellite pairs to obtain statis- 
tical estimates of halo masses. Pioneering efforts in this di- 
rection were made by Erickson, Gottesman & Hunter (1987), 
Zaritsky et al. (1993, 1997), and Zaritsky & White (1994). 
Although these studies were typically limited to samples of 
less than 100 satellites, they nevertheless sufficed to demon- 
strate the existence of extended massive dark haloes around 
(spiral) galaxies. 

More recently, large, homogeneous galaxy surveys, such 
as the Sloan Digital Sky Survey (SDSS; York et al. 2000) 
and the Two Degree Field Galaxy Redshift Survey (2dF- 
GRS; Colless et al. 2001) have dramatically improved both 
the quantity and quality of data on (nearby) galaxies, thus 
allowing the construction of much larger samples of host- 
satellite pairs (McKay et al. 2002; Prada et al. 2003; Brain- 
erd & Specian 2003). Yet, each of these studies has been 



© 2004 RAS 



2 van den Bosch, Norberg, Mo & Yang 



extremely conservative in their selection of "isolated" hosts 
and "tracer" satellites. For example, McKay et al. (2002), 
Prada et al. (2003) and Brainerd & Specian (2003) used 
samples with 1225, 2734, and 2340 satellites, respectively. 
For comparison, the SDSS and 2dFGRS, from which these 
samples were selected, contain well in excess of one hundred 
thousand galaxies. The main reason for being so conservative 
is to prevent too large numbers of interlopers (i.e., satellites 
that are not bound to their host galaxies, but only appear 
associated because of projection effects). In addition, the se- 
lection criteria are optimized to only select isolated galaxies, 
with the motivation that the dynamics of binary systems, 
for example, are more complicated. 

In this paper we use a different approach and investi- 
gate to what extent the kinematics of satellite galaxies may 
be used to estimate the mean mass-to-light ratio, averaged 
over all possible dark matter haloes. We make the simple 
ansatz that satellite galaxies are in virial equilibrium within 
their dark matter potential well. This is motivated by the 
finding that dark matter subhaloes, which are likely to be 
associated with satellite galaxies, have indeed been found to 
be in a steady-state equilibrium (Diemand, Moore & Stadel 
2004). Although we acknowledge that not all systems will be 
fully virialized, we hypothesize that with a sufficiently large 
sample of hosts and satellites the assumption of virial equi- 
librium is sufficiently accurate to describe the mean proper- 
ties. We use detailed mock galaxy redshift surveys (hereafter 
MGRSs) to optimize the selection criteria for host and satel- 
lite galaxies and show that an iterative, adaptive selection 
criterion is ideal to limit the number of interlopers while 
still yielding large numbers of hosts and satellites. Applying 
our selection criteria to the 2dFGRS yields 8132 hosts with 
12569 satellites. We show that the kinematics of these satel- 
lite galaxies are in good agreement with predictions based 
on the conditional luminosity function introduced by Yang, 
Mo & van den Bosch (2003a) and van den Bosch, Yang & 
Mo (2003a). 

This paper is organized as follows. In Section 2 we 
first describe, in detail, the construction of our MGRSs. In 
Section 3 we use these MGRSs to compare three different 
selection criteria for host and satellite galaxies. We inves- 
tigate the impact of interlopers and non-central hosts on 
the satellite kinematics, and show how stacking data from 
flux-limited surveys results in a systematic overestimate of 
the true average velocity dispersion of satellite galaxies. We 
show how the CLF can be used to take these biases into 
account. In Section 4 we apply our selection criteria to the 
2dFGRS and compare the results to analytical estimates and 
to our MGRS. We summarize our findings in Section 5. 

Throughout we assume a flat ACDM cosmology with 
Q. m = 0.3, tt A = 0.7, h = H o /(W0 kms^Mpc" 1 ) = 0.7 
and with initial density fluctuations described by a scale- 
invariant power spectrum with normalization erg = 0.9. 



2 MOCK GALAXY REDSHIFT SURVEYS 

What is the best way to select host and satellite galaxies 
from redshift surveys such as the 2dFGRS and the SDSS? 
How does the flux-limited nature of these surveys impact 
on the results? How do interlopers bias the mass estimates? 
In order to address these and other questions we use de- 



tailed mock galaxy redshift surveys. These have the ad- 
vantage that (i) we know exactly the input mass-to-light 
ratios that we aim to recover, (ii) we can mimic realistic 
host /select ion criteria and investigate the impact of inter- 
lopers. In addition, MGRSs allow a detailed investigation of 
the effect of Malmquist bias, various survey incompleteness 
effects, boundary effects due to the survey geometry, etc. 

To construct MGRSs two ingredients are required; a 
distribution of dark matter haloes and a description of how 
galaxies of different luminosity occupy haloes of different 
mass. For the former we use large numerical simulations (see 
Section 2.1 below), and for the latter the conditional lumi- 
nosity function <&(L\M)dL. The conditional luminosity func- 
tion (hereafter CLF) was introduced by Yang et al. (2003a) 
and van den Bosch et al. (2003a) as a statistical tool to 
link galaxies to their dark matter haloes, and describes the 
average number of galaxies with luminosity L ± dL/2 that 
reside in a halo of mass M. As shown in these papers, the 
CLF is well constrained by the 2dFGRS luminosity function 
of Madgwick et al. (2002) and the correlation lengths as a 
function of luminosity obtained by Norberg et al. (2002a). 
Details about the CLF used in this paper can be found in 
Appendix A. 



2.1 Numerical Simulations 

The distribution of dark matter haloes is obtained from a 
set of large A-body simulations (dark matter only) for a 
ACDM 'concordance' cosmology with £7 m = 0.3, 57a = 0.7, 
h — 0.7 and as = 0.9. The set consists of a total of six 
simulations with N — 512 3 particles each, and is described 
in more detail in Jing (2002) and Jing & Suto (2002). All 
simulations consider boxes with periodic boundary condi- 
tions; in two cases Lbox = lOO/i -1 Mpc while the other four 
simulations all have Lbox = 300ft -1 Mpc. Different simu- 
lations with the same box size are completely independent 
realizations and are used to estimate uncertainties due to 
cosmic variance. The particle masses are 6.2 x 10 8 /i _1 Mg 
and 1.7 x 10 10 ft _1 MQ for the small and large box simula- 
tions, respectively. In what follows we refer to simulations 
with Lbox = 100ft -1 Mpc and Lbox = 300ft -1 Mpc as Lioo 
and L300 simulations, respectively. 

Dark matter haloes are identified using the standard 
friends-of-friends algorithm (Davis et al. 1985) with a linking 
length of 0.2 times the mean inter-particle separation. For 
each individual simulation we construct a catalogue of haloes 
with 10 particles or more, for which we store the mass, the 
position of the most bound particle, and the halo's mean 
velocity and velocity dispersion. Haloes that are unbound 
are removed from the sample. In Yang et al. (2003b) we 
have shown that the resulting halo mass functions are in 
excellent agreement with the analytical halo mass function 
given by Sheth, Mo & Tormen (2001a) and Sheth & Tormen 
(2002). 

2.2 Halo Occupation Numbers 

Because of the mass resolution of the simulations and be- 
cause of the completeness limit of the 2dFGRS we adopt 
a minimum galaxy luminosity of L m in = 3 x 10 7 ft -2 L 
throughout. The mean number of galaxies with L > L m i n 
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that resides in a halo of mass M follows from the CLF ac- 
cording to: 



(N), 



$(L|M) AL. 



(1) 



In order to Monte-Carlo sample occupation numbers for in- 
dividual haloes one requires the full probability distribution 
P(N\M) (with N an integer) of which (N)m gives the mean, 



is the brightest galaxy in each halo, and we draw its lumi- 
nosity, L c , constrained. It therefore has an expectation value 
of 



(L c ) f 



-r 



<&(L\M) LdL 



(5) 



The remaining N — 1 satellite galaxies are assigned lumi- 
nosities in the range L m in < L < L\ drawn at random from 
the distribution function $(L|M). 



{N) M = ^ AP(iV|M) 



(2) 



We use the results of Kravtsov et al. (2003), who has 
shown that the number of subhaloes follows a Poisson dis- 
tribution. In what follows we differentiate between satellite 
galaxies, which we associate with these dark matter sub- 
haloes, and central galaxies, which we associate with the 
host halo (cf. Vale & Ostriker 2004). The total number of 
galaxies per halo is the sum of iV C en, the number of central 
galaxies which is either one or zero, and jV sat , the (unlimited) 
number of satellite galaxies. We assume that N Bat follows a 
Poissonian distribution and require that A sat = whenever 
A con — 0. The halo occupation distribution is thus specified 
as follows: if {N)m < 1 then iV S at = and iV ce n is either zero 
(with probability P = 1 — (N}m) or one (with probability 
P = (N) M ). If (N) M > 1 then A con = 1 and iV sat follows 
the Poisson distribution 



P(iV sat |M) = e" 



i A 4 



iVsat! 



(3) 



with fj, = (iV S at)M = {N)m — 1. As discussed in Kravtsov 
ct al. (2003) the resulting P(N\M) is significantly sub- 
Poissonian for haloes with small {N)m (i.e., low mass 
haloes), but approaches a Poissonian distribution for haloes 
with large {N)m- Such P(N\M) is supported by both semi- 
analytical models and hydrodynamical simulations of struc- 
ture formation (Seljak 2000; Benson et al. 2000; Scoccimarro 
et al. 2001; Berlind et al. 2003), and has been shown to yield 
correlation functions in better agreement with observations 
than, for example, for a pure Poissonian P{N\M) (Benson 
et al. 2000; Berlind & Weinberg 2002; Yang et al. 2003a). 



2.3 Assigning galaxies their luminosity and type 

Since the CLF only gives the average number of galaxies 
with luminosities in the range L ± dL/2 in a halo of mass 
M, there are many different ways in which one can assign 
luminosities to the N galaxies of halo i and yet be con- 
sistent with the CLF. The simplest approach would be to 
simply draw Ni luminosities (with L > L m i n ) from <E>(L|Af). 
We refer to this luminosity sampling as 'random'. Alterna- 
tively, one could use a more constrained approach, and, for 
instance, always demand that the j th brightest galaxy has a 
luminosity in the range [Lj,Lj-{\. Here Lj is defined such 
that a halo has on average j galaxies with L > Lj, i.e., 



J L, 



$(L\M)dL 



(4) 



We refer to this luminosity sampling as 'constrained'. 

We follow Yang et al. (2003b) and adopt an intermedi- 
ate approach. Throughout we assume that the central galaxy 



2.4 Assigning galaxies their phase-space 
coordinates 

Next the mock galaxies need to be assigned a position and 
velocity within their halo. We assume that each dark mat- 
ter halo has an NFW density distribution (Navarro, Frenk 
& White 1997) with virial radius r v i r , characteristic scale 
radius r s , and concentration c = r v i T /r s . Throughout this 
paper we compute halo concentrations as a function of halo 
mass using the relation given by Eke, Navarro & Steinmetz 
(2001), properly accounting for our definition of halo mass^. 
The "central" (brightest) galaxy in each halo is assumed to 
be located at the halo centre, which we associate with the 
position of the most bound particle. Satellite galaxies are as- 
sumed to follow a radial number density distribution given 
by 



n sat (r) oc 



(— 

\TZr, 



(6) 



(limited to r < r v i r ) with a and 7Z two free parameters. 
Unless specifically stated otherwise we adopt a = TZ = 1 for 
which the number density distribution of satellite galaxies 
exactly follows the dark matter mass distribution. 

Finally, peculiar velocities are assigned as follows. We 
assume that the "central" galaxy is located at rest with re- 
spect to its halo, and set its peculiar velocity equal to the 
mean halo velocity. Satellite galaxies are assumed to be in 
a steady-state equilibrium within the dark matter potential 
well with an isotropic distribution of velocities with respect 
to the halo centre. As shown by Diemand et al. (2004) this 
is a good approximation for dark matter subhaloes, and we 
assume it also applies to satellite galaxies. One dimensional 
velocities are drawn from a Gaussian 



27rcr sa t (r) 



exp 



2<r s 2 at (r) 



(7) 



with Vj the velocity relative to that of the central galaxy 
along axis j, and (J sat (r) the local, one-dimensional velocity 
dispersion obtained from solving the Jeans equation 



t(r) = 



n sa t(r )-Q^(r)dr 



(8) 



with ^t(r) the gravitational potential (Binney & Tremaine 
1987). Substituting eq. (6) for the spatial number density 
distribution of satellites yields 



t(r) = 



cVi 



(- 



t Throughout this paper halo masses, denoted by M, are defined 
as the mass inside the radius Rigo inside of which the average 
overdensity is 180. 
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^ r 



/j,i(x)dx 



with 



H a {x) = 



y 



/rg {x/n)^{i + x/nY— 



(9) 



(10) 



Table 1. Selection Criteria 



For a = TZ = 1 the satellite galaxies follow the same density 
distribution as the dark matter (i.e., no spatial bias), and (9) 
reduces to the radial velocity dispersion profile of a spherical 
NFW potential 



x 3 (l + x) 2 



2 i x c\/ 

(cf. Klypin et al. 1999). 

2.5 Creating Mock Surveys 

We aim to construct MGRSs with the same selection criteria 
and observational biases as in the 2dFGRS, out to a maxi- 
mum redshift of z max = 0.15. We follow Yang et al. (2003b) 
and stack 4x4x4 identical L300 boxes (which have periodic 
boundary conditions), and place the virtual observer in the 
centre. The central 2x2x2 boxes, are replaced by a stack 
of 6 x 6 x 6 L100 boxes (see Fig. 11 in Yang et al. 2003b). 
This stacking geometry circumvents possible incompleteness 
problems in the mock survey due to insufficient mass reso- 
lution of the Z/300 simulations, and easily allows us to reach 
£max = 0.15 in all directions. We mimic the various obser- 
vational selection and completeness effects in the 2dFGRS 
using the following steps: 

(i) We define a (a, (5)-coordinate frame with respect to the 
virtual observer at the centre of the stack of boxes, and re- 
move all galaxies that are not located in the areas equivalent 
to the NGP and SGP regions of the 2dFGRS. 

(ii) For each galaxy we compute the redshift as 'seen' by 
the virtual observer. We take the observational velocity un- 
certainties into account by adding a random velocity drawn 
from a Gaussian distribution with dispersion 85km s _1 (Col- 
less et al. 2001), and remove those galaxies with z > 0.15. 

(iii) For each galaxy we compute the apparent magni- 
tude according to its luminosity and distance, to which 
we add a rms error of 0.15 mag (Colless et al. 2001; Nor- 
berg et al. 2002b). Galaxies are then selected according to 
the position-dependent magnitude limit, obtained using the 
apparent magnitude limit masks provided by the 2dFGRS 
team. 

(iv) To take account of the completeness level of the 2dF- 
GRS parent catalogue (Norberg et al. 2002b) we randomly 
remove 9% of all galaxies. 

(v) To take account of the position- and magnitude- 
dependent completeness of the 2dFGRS, we randomly sam- 
ple each galaxy using the completeness masks provided by 
the 2dFGRS team. 

Each MGRS thus constructed contains, on average, 
144000 galaxies, with a dispersion of ~ 2600 due to cosmic 
variance. As we show in Section 4 this is in perfect agree- 
ment with the 2dFGRS. In addition, we verified that the 
MGRSs also accurately match the clustering properties (see 
Yang et al. 2003b), the apparent magnitude distribution and 
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Column (1) indicates the ID of the selection criterion, the pa- 
rameters of which are listed in Columns (2) to (7) and described 
in the text. (T200 is defined as the velocity dispersion of satellite 
galaxies around the host galaxy of interest in units of 200 km s -1 . 

the redshift distribution of the 2dFGRS. Thus, overall our 
MGRSs are fair representations of the 2dFGRS. 



3 METHODOLOGY 
3.1 Selection Criteria 

A galaxy is considered a potential host galaxy if it is at least 
fh times brighter than any other galaxy within a volume 
specified by R p < R h and |AV| < (AV) h - Here R p is the 
separation projected on the sky at the distance of the can- 
didate host, and AV is the line-of-sight velocity difference. 
Around each potential host galaxy, satellite galaxies are de- 
fined as those galaxies that are at least f s times fainter than 
their host and located within a volume with R p < R s and 
\AV\ < (AV)s- Host galaxies with zero satellite galaxies are 
removed from the list of hosts. 

In total, the selection of hosts and satellites thus de- 
pends on six free parameters: Rh, (AV)/, and fh to spec- 
ify the population of host galaxies, and R s , (AV) a and 
f s to specify the satellite galaxies. These parameters also 
determine the number of interlopers (defined as a galaxy 
not physically associated with the halo of the host galaxy) 
and non-central hosts (defined as a host galaxy that is not 
the brightest, central galaxy in its own halo). Minimizing 
the number of interlopers requires sufficiently small R s and 
(AV)s- Minimizing the number of non-central hosts requires 
one to choose Rh, (AV)h and fh sufficiently large. Of course, 
each of these restrictions dramatically reduces the number 
of both hosts and satellites, making the statistical estimates 
more and more noisy. 

We compare three different selection criteria (hereafter 
SC), that only differ in their values for the six parameters 
described above (see Table 1), and use our MGRS to inves- 
tigate the resulting fractions of interlopers and non-central 
hosts. Most results are summarized in Table 2 and Fig. 1. 
SC 1 is identical to that used by McKay et al. (2002) and 
Brainerd & Specian (2003), who investigated the kinemat- 
ics of satellite galaxies in the SDSS and 2dFGRS, respec- 
tively. The same SC was also used by Prada et al. (2003) 
for their sample 3. It uses a fairly restrictive set of parame- 
ters; host galaxies must be at least two times more luminous 
(fh = 2) than any other galaxy within a volume specified by 
R h = 2k' 1 Mpc and (AV) h = 1000 kms -1 . Satellite galax- 
ies are selected as those galaxies within R a — 0.5/i _1 Mpc 
and [AV) S = 1000 km s -1 around each host that are at least 
four times fainter (f s = 4.0) than the host. Applying these 
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Figure 1. Various statistical properties of host /satellite pairs obtained from our MGRS using three different selection criteria (SC). 
Panels on the left plot a histogram of the distribution of AV = V a at — V^ ost . The contributions of interlopers and satellites around non- 
central hosts are indicated by hatched and cross-hatched histograms, respectively. Note that the P(AV) of interlopers is not uniform, 
as typically assumed. The open circles in the panels in the middle column indicate the interloper fraction /; nt for a number of bins in 
^host- Horizontal errorbars indicate the range of Lhost used. Dashed lines indicate the interloper fractions obtained by fitting P(AV) 
with a Gaussian plus a constant (see Section 3.3), which systematically underpredicts the true interloper fraction. Solid lines indicate 
the fraction of satellites around non-central host galaxies. Panels on the right plot the fraction of satellites in each bin of L^ ost . Note 
how the number of satellites decreases rapidly with decreasing Lhosti an d that this decrease is less pronounced with SC 3. 



SC to our MGRS (which consists of a total of 143727 galax- 
ies), yields 1851 hosts and 3876 satellites. The fraction of 
interlopers is 27 percent, while only one percent of the hosts 
is non-central (see Table 2). 

The open circles in the upper, middle panel of Fig. 1 
show that the interloper fraction, /i nt , is larger around lower- 
luminosity hosts, reaching as high as ~ 60 percent around 



the faintest hosts in the sample. Clearly, accurate estimates 
for halo masses based on the kinematics of satellite galaxies 
requires a proper correction for these interlopers, and thus a 
detailed knowledge of their velocity distribution, Pi nt (AV). 
Thus far, the standard approach has been to assume that 
Pint (AF) is uniform (i.e., McKay et al. 2002; Brainerd & 
Specian 2003; Prada et al. 2003). The upper left-hand panel 
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of Fig. 1 plots P(AV) for host-satellite pairs selected from 
our MGRS using SC 1. The hatched histogram shows the 
contribution of interlopers. Clearly, Pmt(AV) is not uniform, 
but instead reveals a pronounced peak around AV = 0. This 
is due to (i) the fact that galaxies, including interlopers, are 
clustered, and (ii) the infall of galaxies around overdense 
regions. As we show below, assuming a uniform Pi nt (AV) 
in the analysis of satellite kinematics (as is generally done) 
results in a systematic underestimate of /i n t but, fortunately, 
does not lead to significant errors in the kinematics. 

With SC 2 our main objective is to increase the num- 
ber of hosts and satellites, and to include galaxy groups and 
clusters on a similar footing as 'isolated' galaxies. In SC 2 we 
therefore set both fa and f 3 to unity. This greatly increases 
the number of both hosts and satellites, and allows brightest 
cluster galaxies to be included as host galaxies. In order to 
cover a sufficiently large volume in velocity space to prop- 
erly sample rich clusters we enlarge both (AV)j, and (AV) S 
to 2000 kms -1 . As we show below, this has the additional 
advantage that it allows a better determination of the con- 
tribution of interlopers, and therefore a more accurate, sta- 
tistical correction. SC 2 results in 4 to 5 times as many hosts 
(AW = 7863) and satellites (AT sat = 19099) as with SC 1. 
However, the fraction of interlopers has also increased, from 
27 to 39 percent. Fortunately, as is evident from the middle- 
left panel of Fig. 1, most of these excess interlopers have 
1000 kms" 1 < \ AV\ < 2000 kms -1 and are easily corrected 
for (see Section 3.3). As for SC 1, the fraction of interlopers 
increases strongly with decreasing I/host , with more than 70 
percent interlopers around the faintest hosts in the sample. 
The fraction of non-central hosts, / nc , has increased from 
one to five percent compared to SC 1. This is mainly a con- 
sequence of setting fh = 1, which allows bright galaxies in 
the outskirts of clusters to be (erroneously) selected as hosts. 
As evident from the cross-hatched histogram in the middle- 
left panel of Fig. 1, the P(AV) of satellite galaxies around 
these non-central hosts is very broad. As for the interlopers, 
accurate estimates of halo masses based on satellite kine- 
matics require keeping the number of non-central hosts as 
small as possible. 

Both selection criteria discussed so far yield fairly high 
fractions of interlopers, especially around faint hosts. In ad- 
dition, as is evident from the panels on the right-hand side 
of Fig. 1, the number of host-satellite pairs decreases very 
rapidly for I/host ,$ 3 x 10 10 h~ 2 Lq. This is mainly due to the 
limited number of faint hosts that make the selection crite- 
ria. Ideally, one would use adaptive selection criteria, adjust- 
ing (AV)h, Rh and R 3 to the virial radius and virial velocity 
of the halo of the host galaxy in question. This, however, re- 
quires prior knowledge of the halo masses as a function of 
^host, which is exactly what we are trying to recover from 
the satellite kinematics. We therefore use an iterative proce- 
dure: start from an initial guess for o- sa t(L h ost) and estimate 
the corresponding virial radius and velocity around each in- 
dividual host galaxy. Use these to adapt (AV)h, Rh, and 
R s to the host galaxy in question, and select a new sample 
of host-satellite pairs. Use the new sample to obtain an im- 
proved estimate of cr sat (I/host), and start the next iteration. 
In detail we proceed as follows: 



(i) Use SC 2 to select hosts and satellites 



(ii) Fit the satellite kinematics of the resulting sample 
with a simple functional form (see Section 3.3). 

(iii) Select new hosts and satellites using (AV)h = 
1000a2oo kms -1 , (AV) S = 2000 kms" 1 , R h = 
0.8<T2ooft _1 Mpc, and R s = 0.15<72ooft -1 Mpc. Here 0200 is 
o" sa t(ihost) in units of 200 kms -1 . 

(iv) Repeat (ii) and (iii) until a S sut{L hoBt ) has converged 
to the required precision. Typically this requires 3 to 4 iter- 
ations. 

The numerical values in step (iii) are based on extensive tests 
with our MGRSs, optimizing the results using the following 
criteria: large A^at and Ahost, small /i n t and / nc , and good 
sampling of I/host- The Rh and R s correspond roughly to 
2.0 and 0.375 times the virial radius, respectively. Applying 
this adaptive SC to our MGRS yields 10483 hosts and 16750 
satellites (after 4 iterations). The number of host galaxies 
has drastically increased with respect to SC 2. As we show 
below, this allows us to probe the satellite kinematics down 
to host galaxies with much fainter luminosities. The number 
of satellite galaxies, on the other hand, has decreased with 
respect to SC 2. This mainly reflects a drastic decrease in 
the number of interlopers, from 39 to 15 percent. More im- 
portantly, the interloper fraction no longer strongly depends 
on Lhost- 



3.2 Analytical Estimates 

When investigating the impact of interlopers and non- 
central hosts on the kinematics of satellite galaxies, and 
comparing different selection criteria, it is useful to have an 
analytical estimate of the expected cr sat (L). This section de- 
scribes how the conditional luminosity function (CLF) may 
be used to compute (a sat (L)) for a flux-limited survey such 
as the 2dFGRS. 

For a halo of mass M, the expectation value for the 
projected velocity dispersion of satellite galaxies is given by 



(<7s a t )m = 



47T 



(A^sat)/ 



n sat (r) <7 Bat (r) r dr 



(12) 



with (A r sat )M the mean number of satellites with L > L n 
in a halo of mass M which is given by 



(ATsat}' 



-f. 



$(L|M) dL- 1 



47r / n sat (r)r 2 dr 



Substituting (6) and (9) yields 

V vir r~^~ f /TC y 2 - Q/2 T 1/2 (y) 



(cr sa t); 



with 



li a (c/TZ) V TZm(c) J Q (1 + 2/) 



U3-c)/2 



jl \ [°° fJ,l(R.T)dT 

W J y ra+ 2 (1 + r) 3-a 



(13) 



dy(U) 



(15) 



In general, there will not be a one-to-one, purely de- 
terministic, relation between halo mass and host luminosity. 
Therefore, when averaging over all host galaxies of given lu- 
minosity, the expectation value for the velocity dispersion of 
their satellite galaxies is given by 
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Table 2. Satellite Kinematics. 



Survey 
(1) 


SC 
(2) 


A/total 

(3) 


A/host 

(4) 


AT sa t 

(5) 


fnc 

(6) 


/int 

(7) 


ffit 
•'int 

(8) 


CIO 

(9) 


ai 
(10) 


0-2 

(11) 


MGRS 


1 


143727 


1851 


3876 


0.01 


0.27 


0.20 


130 


0.55 




MGRS 


2 


143727 


7863 


19099 


0.05 


0.39 


0.28 


185 


0.37 


0.17 


MGRS 


3 


143727 


10483 


16750 


0.07 


0.15 


0.12 


176 


0.52 


0.11 


2dFGRS 


3 


146735 


8132 


12569 






0.19 


193 


0.48 


0.13 



Column (1) indicates whether the sample of host and satellite galaxies has been extracted from our MGRS or from the 2dFGRS. 
Column (2) indicates the selection criterion used, the parameters of which are listed in Table 1. Columns (3) indicates the total number 
of galaxies in the survey, while columns (4) and (5) indicate the numbers of host and satellite galaxies, respectively. Columns (6) and (7) 
indicate the fraction / nc of non-central hosts and the (true) fraction of interlopers /i n t, both of which arc only known for the MGRS. 
Finally, columns (8) to (11) list the best-fit parameters obtained from the maximum likelihood method described in Section 3.3. 



(<Tsat(-I<c)) 



P{M\L C ) (a sa .t) M AM 



(16) 



with P(M\L C )AM the conditional probability that a central 
galaxy with luminosity L c resides in a halo of mass M ± 
AM/2, and (cr sat )M is the expectation value for the projected 
velocity dispersion of satellites in a halo of mass M given by 
cq. (14). Using Bayes theorem, we rewrite (16) as 



((Tsat(ic)) = 



J °° P(L C \M) n(M) (a sa t)M AM 
/ °° P(L c \M)n(M) AM 



(17) 



with n(M) the halo mass function and P(L C \M) the con- 
ditional probability that a halo of mass M hosts a central 
galaxy with luminosity L c . In our MGRS, L c is drawn 'con- 
strained' for which 



P(L C \M) 



$(L C \M) 




if L c > Li(M) 
if L c < Li(M) 



(18) 



with L±(M) defined by eq. (4). In Appendix B we show how 
P(L C \M) can be obtained in the case where L c is drawn 
'randomly' (as opposed to 'constrained') from the CLF. The 
dashed curve in Fig 2 plots the host-averaged (cx sa t(L c )) thus 
obtained (see Appendix A for details regarding the CLF 
used). This (er sa t(L c )) is the true mean velocity dispersion 
of satellite galaxies around hosts with luminosity L c , where 
the mean is taken over the number of host galaxies. 

Unfortunately, this is not what an observer who stacks 
many host-satellite pairs together obtains, which instead is 
a satellite-weighted mean. Since more massive haloes typ- 
ically contain more satellites, satellite weighting will bias 
(cr S at (L c )) high with respect to the host-weighted mean. We 
can use the CLF to estimate the magnitude of this 'bias'. 
The satellite- weighted expectation value for er sat (L c ), in a 
volume-limited survey complete down to a limiting luminos- 
ity of L min is 



(cr sa t(L c )) = 



J °° P(M\L C ) <JV S at)M <CT sat >M AM 

/~P(M|L c )<JV M t>Af dAf 



(19) 



The dotted line in Fig. 2 shows the (er sa t(L c )) thus obtained. 
For Lhost ^ 3x 10 9 h~ 2 L the satellite- weighted mean is, as 
expected, larger than the host-weighted mean (by as much 
as ~ 40 percent). For less luminous hosts, however, satellite 
averaging has no significant effect. This is due to the detailed 
functional form of (A/ Bat ) M , which is much shallower at low 
M than at high M (e.g., van den Bosch et al. 2003a). 

The above estimate is based on a complete sampling 



of the satellite population of each halo. In reality, how- 
ever, there are two effects that result in a reduced complete- 
ness. First of all, satellites are only selected within a certain 
projected radius around the host. Whenever that radius is 
smaller than the virial radius, only a fraction of all satellites 
enter the sample, and with a mean velocity dispersion that 
differs from (14). Secondly, in a flux-limited survey the num- 
ber of satellites around a host of given luminosity depends 
on redshift. Again, we can account for these two effects using 
simple algebra. 

Suppose we observe a host-satellite system in projection 
through a circular aperture with radius R p . The expectation 
value for the observed velocity dispersion of the satellites in 
a halo of mass M is given by 



CARRf™ 



^=n sa t(r)a sat (r) 



CARRf™ 



n sat (r) 



(20) 



which is straightforward to compute upon substituting (6) 
and (9). For R p — r vir one obtains the total projected ve- 
locity given by (14). 

The expectation value for a flux-limited survey follows 
from integrating (17) over redshift: 



1 r& pz m nx 
(ffsat(Lc)) = yj An J A 



AV 

z d?TdI^ t(ic ' 2)) (21) 



Here Q. is the solid angle of sky of the survey, AV is the dif- 
ferential volume element, and Zmax is the minimum of the 
survey redshift limit (0.15 in our MGRSs) and the maxi- 
mum redshift out to which a galaxy with luminosity L c can 
be detected given the apparent magnitude limit of the sur- 
vey. The expectation value {cr S a.t(L c , z)) follows from (17) 
upon replacing n(M) with n(M,z) (i.e., accounting for the 

evolution in the halo mass function) t. In addition, since the 
minimum luminosity of a galaxy in a flux-limited survey de- 
pends on redshift the L m in in (13) needs to be replaced with 
Lmin(z). Finally, in case R p < r v h- one needs to use (af at )M 
given by (20) rather than (14). 

The solid curve in Fig. 2 shows the {a aa ,t(Lc)) thus ob- 
tained with R p = r v ; r . Overall, the expectation value for 



t Note that we assume here that the CLF docs not evolve with 
redshift, at least not over the small range of redshift (z < 0.15) 
considered here. 
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Figure 2. Expectation values for <r sat as a function of Z/host com- 
puted using the CLF as explained in the text. The dashed line cor- 
responds to the true, host-averaged mean. The dotted line shows 
the satellite-weighted mean, which is what an observer obtains 
from stacking many host-satellite pairs (using a volume limited 
survey) . The solid and dot-dashed lines show the expectation val- 
ues for a flux-limited survey, computed using 'constrained' and 
'random' host luminosities, respectively. See text for details. 



o"sat(ic) of a flux-limited survey is larger than for a volume- 
limited survey. This owes to the fact that, because of the flux 
limit, smaller mass haloes loose a relatively larger fraction 
of satellites. Finally, the dot-dashed line shows the same ex- 
pectation value, satellite-averaged over a flux-limited survey, 
but computed assuming 'random', rather than 'constrained', 
luminosities for the central galaxies. This increases the width 
of the conditional probability distribution P(M\L C ) (see Ap- 
pendix B), which in turn strongly increases (<T sat (L c )). 

Clearly, satellite weighting from flux-limited surveys in- 
troduces large systematic biases in the kinematics of satel- 
lite galaxies. The magnitude of this bias depends on, among 
others, host luminosity and the second moment of P(M\L C ), 
and can easily be as large as a factor two. Since, to first or- 
der, M oc of at , this implies a systematic overestimate of halo 
masses of almost an order of magnitude. Clearly, if one were 
not to correct for these systematic biases, the mass-to-light 
ratios inferred from satellite kinematics are systematically 
too high by the same amount. Unfortunately, such bias cor- 
rection is model-dependent. Although it is straightforward 
to correct for the bias due to the satellite averaging in a vol- 
ume limited survey (by simply weighting each satellite by the 
inverse of the number of satellites around the corresponding 
host galaxy) , in a flux- limited survey one has to correct for 
missing satellites (those that did not make the flux- limit). 
This requires prior knowledge of the abundances and lumi- 
nosities of satellite galaxies, and is thus model dependent. 
As shown here, the CLF formalism is one such model that 
can be used to model these biases in a straightforward way. 



3.2.1 The connection between spatial bias and velocity 
bias 

The expectation values discussed so far are based on the 
assumption that satellite galaxies follow a NFW number 
density distribution, i.e., we assumed a constant mass-to- 
number density ratio. Numerous studies have shown this to 
be a good approximation for clusters of galaxies (e.g., Carl- 
berg et al. 1997a; van der Marel et al. 2000; Lin, Mohr & 
Stanford 2004; Rhines et al. 2004; Diemand et al. 2004). 
However, numerical simulation have shown that dark mat- 
ter subhaloes typically are spatially anti-biased with respect 
to the mass distribution (i.e., Moore et al. 1998, Ghigna et 
al. 1999; Colin et al. 1999; Klypin et al. 1999; Okamoto & 
Habbe 1999; Springel et al. 2001; De Lucia et al. 2004; Die- 
mand et al. 2004). If satellite galaxies follow a similar anti- 
bias this will reflect itself on the dynamics. We can model 
spatial anti-bias by adjusting the free parameters a and TZ. 
Setting a — 0, for example, introduces a constant number 
density core, and thus spatial anti-bias at small radii. The 
parameter TZ gives the ratio of the scale radii of satellite 
galaxies and dark matter, and can be used to adjust the 
radius of the core region. 

Fig. 3 illustrates the effect that spatial bias has on the 
dynamics of satellite galaxies. We define the local velocity 
bias as 



Mr) 



ofatO) 



1/2 



(22) 



with of at (r) and cr^ m (r) the one-dimensional, isotropic veloc- 
ity dispersions of satellites and dark matter particles, given 
by eq. (9) and (11), respectively. If b v > 1 then satellite 
galaxies typically move faster than dark matter particles in 
the same halo, and we speak of positive velocity bias. If 
b v < 1 the satellites are dynamically colder than the dark 
matter particles, and the velocity bias is said to be negative. 
In addition to the local velocity bias b v (r), we also define the 
global velocity bias 



1/2 



(23) 



where (•) indicates mass or number-averaged quantities (cf. 
eq. [12]). The left-hand panel of Fig. 3 plots, for a halo with 
M = 10 hT M©, the local velocity bias b v (r) for three 
different populations of satellite galaxies with a = and 
TZ = 1, 2, and 3 as indicated. Note the overall positive ve- 
locity bias, which reaches very high values at small radii: a 
(dynamically relaxed) tracer population that is less centrally 
concentrated than the mass distribution has a positive ve- 
locity bias (see also Diemand et al. 2004). The middle panel 
of Fig. 3 shows that the global velocity bias (b v ) is larger for 
more massive haloes (since these have smaller halo concen- 
trations c). The right-hand panel, finally, plots the expecta- 
tion values (<T sat ) as a function of Lhost- In the case of a — 
and TZ = 3 the expectation values are a factor 1.1 to 1.2 
larger than for the case without spatial bias (a = TZ = 1). 
Thus, the effect of spatial (anti)-bias is fairly small; it is 
much more important to have accurate knowledge of the 
conditional probability function P(M\L C ) than of n sa t(r) if 
one is to obtain accurate, unbiased estimates of (a sat ) as a 
function of Lhost- 
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Figure 3. The left-hand panel plots the local velocity bias b v (r) of isotropic, steady-state populations of satellite galaxies that are 
spatially anti-biased with respect to the mass distribution. Results arc shown as a function of radius (normalized to the halo virial 
radius) for three different spatial distributions, as indicated. For comparison, the solid line corresponds to the case without spatial bias. 
Note how spatial anti-bias induces a positive velocity bias. The panel in the middle shows the corresponding global velocity bias, (b v ), 
as a function of halo mass. More massive haloes reveal a larger, positive velocity bias. Finally, the right-hand panel plots corresponding 
expectation values for o" sat (L^ost ) • The larger the spatial anti-bias, the higher the expectation values, although the effect is marginal. 



3.2.2 The impact of orbital anisotropies 

So far we have assumed that the orbits of satellite galax- 
ies are isotropic. However, numerical simulations indi- 
cate that the orbits of dark matter subhaloes, although 
close to isotropic near the centre, become slightly radi- 
ally anisotropic at larger halo-centric radii (e.g. Diemand et 
al. 2004). In this section we estimate how anisotropy impacts 
on the projected velocity dispersion of satellite galaxies. 

The expectation value for the projected velocity disper- 
sion of satellite galaxies is given by eq. (12). As long as this 
expectation value integrates over all satellites, it is almost 
independent of the anisotropy of the orbits. This is most eas- 
ily seen by considering the virial theorem, which states that 
for a virialized system (ct 2 )m = \W\/M, with W the sys- 
tem's total potential energy. Thus, as long as (a) ~ {a 2 } 1 '" 2 , 
which is a good approximation for most realistic systems, 
the projected velocity dispersion of satellite galaxies should 
be virtually independent of the anisotropy of their orbits. 

Contrary to the global value of {o~ 2 ), the local velocity 
dispersion depends quite strongly on anisotropy. Therefore, 
as soon as one only considers a radially dependent fraction 
of the satellites, as is the case with our selection criterion 
3 where we integrate over a circular aperture with radius 
_R ap ~ 0.375iivir, the impact of anisotropy is no longer nec- 
essarily negligible. In order to estimate the amplitude of this 
effect we first solve the Jeans equation in spherical symmetry 



dr 



- j. 2 P 
po T H pa 



d* „ 



(24) 



„V , is a mea- 



(Binney & Tremaine 1987). Here f3(r) = 1- 
sure of the orbital anisotropy, and a r (r) and a t (r) are the 
velocity dispersions in the radial and tangential directions, 
respectively. Assuming a constant value for (3, and using the 
boundary condition p(r)a 2 (r) — > for r — > oo, the solution 
to this linear differential equation of first order is 



'■ p(r) M(r) dr 



(25) 



closed within radius r. The expectation value for the pro- 
jected velocity dispersion integrated over a circular aperture 
with radius _R ap is given by 



<* 2 >(/?) = 



fMR/r) 2 



■dr 



=dr 



(26) 

Ir 

y/l-(R/r)* 

where we have made use of eq [4-60] in Binney & Tremaine 
(1987). To compute the impact of orbital anisotropy we sub- 
stitute eq. (25) in the above expression and use an NFW 
density distribution to compute the ratio 

jl/2 



w 2 m 



(a 2 (Q)) 



(27) 



with G the gravitational constant and M(r) the mass en- 



where we set i? ap = 0.375i? v ir as appropriate for our selec- 
tion criterion 3. We find that £ increases from 0.97 to 1.05 
when /3 increases from —0.5 to 0.5. Clearly, even when using 
an aperture that is only about one-third of the virial radius, 
orbital anisotropy effects the projected velocity dispersion 
of satellite galaxies only at the level of a few percent. 



3.3 Kinematics 

The expectation values discussed above are all based on 
an idealized situation without interlopers and non-central 
hosts. We now turn to our MGRSs from which we select 
hosts and satellites using the selection criteria discussed in 
Section 3.1. We analyze the kinematics of the satellite galax- 
ies and compare the results to the analytical predictions 
presented above. This allows us to investigate the impact of 
interlopers and non-central hosts, and to properly compare 
the different selection criteria. 

The upper panels of Fig 4 show scatter plots of AV = 
Vsat — Vhost as a function of Lhost , which represents the 'raw 
data' on satellite kinematics that we seek to quantify. We 
follow McKay et al. (2002) and Brainerd & Specian (2003) 
and proceed as follows. We bin all host-satellite pairs in a 
number of bins of Lhost and fit the distribution P(AV) for 
each of these bins with the sum of a Gaussian (to represent 
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the true satellites) plus a constant (to represent the interlop- 
ers). The final estimate of the projected velocity dispersion 
of satellite galaxies, cr sa t, follows from the velocity disper- 
sion of the best-fit Gaussian after correcting for the error 
in AV. In our attempt to mimic the 2dFGRS, we added 
a Gaussian error of 85 kms" (see Colless et al. 2001) to 
the velocity of each galaxy in our MGRS. The error on AV 
is therefore equal to \p2 x 85 kms -1 = 120 kms -1 , which 
we subtract from cr sat in quadrature. The solid circles with 
errorbars in the lower panels of Fig. 4 plot the a" sa t (Lhost) 
obtained for the three samples extracted from our MGRS. 
Vertical errorbars are obtained from the covariance matrix of 
the Levenberg-Marquardt method used to fit the Gaussian- 
plus-constant to P(AV), and horizontal errorbars indicate 
the range of luminosities of the host galaxies in each bin. 
The dotted, horizontal line indicates the 'resolution limit' of 
120 kms -1 . 

The constant term in the fitting function has typi- 
cally been interpreted as representing the contribution due 
to interlopers (McKay et al. 2002; Brainerd & Specian 
2003). However, we have shown above that Pi nt (AV) dif- 
fers strongly from a uniform distribution. The /int(Lh os t) 
that follows from integrating the constant term is shown as 
a dashed line in the panels in the middle column of Fig. 1. 
As expected, these interloper fractions are systematically 
too low compared to the true interloper fractions (open cir- 
cles). Unfortunately, with real data the detailed Pi nt (AV) 
is unknown, making it difficult to properly correct for the 
interlopers. We therefore devised a strategy that aims at 
tuning the selection criteria to limit the number of inter- 
lopers to acceptable levels (cf. SC 3). In order to remove 
interlopers with large \AV\ we still take the constant term 
into account in the fitting function. As long as the fraction 
of interlopers is sufficiently small, the remaining interlopers 
should not strongly effect the kinematics. 

In addition to discrete measurements of a sat for sev- 
eral independent bins in Lhost, we also use a method that 
fits cr sa t(ihost) to all host-satellite pairs simultaneously. We 
parameterize the relation between a sat and Lhost with a 
quadratic form in the logarithm: 



logcr sat = logcrio + ailogLio + a 2 (logLi ) 2 



(28) 



Here L w = Lh O st/10 10 fr -2 L and a w = £7 sa t(Lio). Let / int 
denote the fraction of interlopers, and assume that /int is 
independent of Lhost and/or AV. Then, the probability that 
a satellite around a host with luminosity Lhost has a velocity 
difference with respect to the host of AV kms -1 is given by 



P{AV) = 



2{AV) S 



(l-/int) 

H exp 



AV 



(29) 



Here a e s = \J of at + 120 2 defines the effective dispersion 
that takes account of the velocity errors, and 



zu — \p2m cr e ff erf 



\/2<j cB 



(30) 



so that (29) is properly normalized to unity over the valid 
range \AV\ < (AV) S . We use Powell's directional set 
method to find the parameters (ffio, ai, 02, /int) that max- 
imize the likelihood C = '^2 i \n[P(AVi)] where the sum- 
mation is over all satellites. It is this maximum likelihood 
method that we use to parameterize the satellite kinemat- 



ics in our adaptive selection criterion (SC 3) introduced in 
Section 3.1. Columns (8)-(ll) of Table 2 list the best-fit val- 
ues for /i n t (to be compared to the true interloper fraction 
listed in column 7), aio, ai and a 2 obtained from fitting the 
host-satellite pairs extracted from the MGRS. The dashed 
lines in the lower panels of Fig. 4 indicate the corresponding 
best-fit o- sa t (Lhost). 

With SC 1 the satellite kinematics can only be measured 
accurately over about a factor 5 in luminosity (cf. Brain- 
erd & Specian 2003). Therefore, when fitting <7 sa t(Lhost) us- 
ing the maximum likelihood method we keep 02 fixed at 
zero, such that eq. (28) reduces to a simple power-law. As 
can be seen, the maximum likelihood method and the dis- 
crete Gaussian-plus-constant fits yield satellite velocity dis- 
persions in good agreement with each other. Because fh and 
f s are not equal to unity with SC 1, it is difficult to compute 
expectation values for a sa t (Lhost) based on the CLF. In order 
to assess the impact of interlopers and non-central hosts we 
therefore compute the velocity dispersion of the true satellite 
galaxies directly from the MGRS: open circles in the upper- 
left panel of Fig. 4 correspond to J2i=i( A V) 2 ~ 12 ° 2 
where the summation is over all true satellites (excluding in- 
terlopers and satellites around non-central hosts) . The best- 
fit <j sa t(L), both from the maximum likelihood method as 
well as from the discrete Gaussian-plus-constant fits, are in 
good agreement with these true values, indicating that the 
incomplete correction for interlopers does not significantly 
influence the satellite kinematics. 

In the case of SC 2, the luminosity range over which 
accurate measurements of <r sat (L) can be obtained has in- 
creased to almost one and a half orders of magnitude. The 
o"sat (L) is in reasonable agreement with the expectation val- 
ues computed using eq. (21) with R p — r vir (thick solid line), 
except for the lowest luminosity bins. This is due to the large 
fraction of (excess) interlopers and to the presence of satel- 
lite galaxies around non-central hosts. Especially the latter 
can cause a significant overestimate of the true o" sa t (L) . Nev- 
ertheless, despite an interloper-fraction of 39 percent, SC 2 
allows one to recover the expected a sat (L) with reasonable 
accuracy. 

The results for SC 3 are even more promising. Because 
of the large number of faint host galaxies, a sa t (Lhost) can 
be measured over two and a half orders of magnitude down 
to ~ 4 x 10 8 h~ 2 Lq. The cr sa t obtained are in good agree- 
ment with the expectation values (solid line, computed us- 
ing eq. (21) with R p — 0.375r v i r ) , even in the regime where 
(Js a t < 120 kms -1 . The dashed line indicates the best-fit 
o" sa t (Lhost) obtained from the maximum-likelihood method, 
the corresponding parameters of which are listed in Table 2. 

Fitting eq. (28) directly to the expected o" sa t(Lhost) 
(solid line in the lower right-hand panel of Fig. 4) yields 
as best-fit parameters: aio = 178 kms -1 , a\ = 0.58, and 
(22 = 0.10. Thus, the expectation is somewhat steeper than 
the actual best-fit relation (for which ai = 0.52). In or- 
der to address the significance of this difference we pro- 
ceed as follows: we construct Monte-Carlo samples based on 
aio = 176kms -1 , ai — 0.52, 02 = 0.11, and / n t = 0.12, cor- 
responding to the best- fit values obtained for the MGRS. 
For each of the 16841 satellite galaxies in the MGRS we 
randomly draw a AV from equation (29), using / n t and 
o" sa t (Lhost) given by eq. (28). To this AV we add a Gaus- 



© 2004 RAS, MNRAS 000, 000-000 



Probing Dark Matter Haloes with Satellite Kinematics 11 



1000 - 



s 



> 
< 



-1000 - 



E 



v 



00 

o 



2.5 - 



2 - 



1.5 - 



| I I I I | I I I I | I I I I | I I I I | I I I | I I I 1 J - 1 I TT-p 1 I:' - 1 I :| I l l T1 | I I I | TT I I | III I | I I I I J I I 1 I |" 



I I I | I I I I | I I I I | I I I I | I I I I | 



MGRS 
SC 1 




I I I | I I I I | I I I I | I I I I | I I 




I I I | I I I I | I I I I | I I I I | I I I 1/ | - 



).5 10 10.5 11 
log[L hogl ] (h"%) 



.5 10 10.5 11 



Figure 4. Kinematics of satellite galaxies in the MGRS. Upper panels show scatter plots of AV versus Lh os t for the three different 
host-satellite samples discussed in the text. Lower panels plot the corresponding satellite velocity dispersions. Solid dots with errorbars 
indicate the o" sa t(£host) obtained by fitting P(AV) with the sum of a Gaussian and a constant term. Vertical errorbars are obtained 
from the covariance matrix of the Lcvcnbcrg-Marquardt method used for the fitting, and horizontal errorbars indicate the range of host 
luminosity used. The dashed lines indicate the best-fit cr sat (Lhost) obtained using the maximum-likelihood method. Open circles in the 
lower left-hand panel, and solid lines in the lower middle and right-hand panels, indicate the expectation values. Any difference between 
the cr S at (-Lhost ) and these expectation values is due to interlopers, satellites around non-central hosts, and shot noise (see text for detailed 
discussion). The dot-dashed line indicates the host-averaged mean, and is shown for comparison. The dotted, horizontal line indicates 
c sa t = 120 kms^ 1 which corresponds to the 'resolution limit' (due to the errors on AV). 



sian deviate with standard deviation of 120 km s -1 to mimic 
the velocity errors. This yields a sample of (AV^, Lh os t) with 
the same distributions of Lhost and jV sa t as for the MGRS. 
Next we apply the maximum likelihood method and find 
the best-fit values of (<tio, Oi, a 2 , /int). The distributions of 
these best-fit values, obtained from 1000 of these Monte- 
Carlo samples, are shown in Fig. 5. Clearly, the maximum- 
likelihood method accurately recovers the input values of 
ctio, oi, ci2, and /i nt (indicated by vertical and horizontal 
lines) with la errorbars of 3 kms~\ 0.02, 0.04 and 0.005, 
respectively (and with the errors on a\ and d2 somewhat cor- 
related). Given these random errors, the difference between 
the dashed and solid lines is marginally significant, reflect- 
ing the effect of interlopers and (mainly) satellites around 
non-central hosts. Nevertheless, the difference is sufficiently 
small, that we conclude that it is possible to use large, flux 
limited redshift surveys to obtain accurate estimates of the 
velocity dispersion of satellite galaxies in haloes that span a 
wide range of masses. However, keep in mind that these are 
biased with respect to the host-averaged means. 



4 RESULTS FOR THE 2dFGRS 

We now focus on real data. We use the final, public data re- 
lease from the 2dFGRS, restricting ourselves only to galaxies 
with redshifts 0.01 < z < 0.15 in the North Galactic Pole 
(NGP) and South Galactic Pole (SGP) subsamples with a 
redshift quality parameter q > 3. This leaves a grand total of 
146735 galaxies with a typical rms redshift error of 85km s _1 
(Colless et al. 2001). Absolute magnitudes for galaxies in the 
2dFGRS are computed using the K-corrections of Madgwick 
et al. (2002). 



4.1 Satellite Kinematics 

Applying our adaptive selection criterion (SC 3) to the 2dF- 
GRS yields 8132 host galaxies and 12569 satellite galaxies. 
The first thing to notice is that these numbers are quite a 
bit smaller than for the MGRS. In order to check whether 
this is consistent with cosmic variance, we constructed four 
independent MGRSs, using different L100 and L300 simula- 
tion boxes. Applying SC 3 to each of these MGRSs yields 
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Figure 5. The upper panels plot distributions of /j nt , criO) a l> an< i a 2 (as indicated) obtained from 1000 Monte-Carlo simulations. These 
are used to estimate errors on these parameters obtained from fitting the distribution of (AV, Lhost) in the MGRS (see text). Thick, 
vertical lines indicate the input values. The lower panels show two-dimensional scatter plots of these parameters. Horizontal and vertical 
lines again indicate the input values. 



AW = 10600 ± 300 and AT sat = 16300 ± 550. Clearly, the 
deficit of host and satellite galaxies in the 2dFGRS compared 
to our MGRS is very significant. We address the abundances 
of host and satellite galaxies in much more detail in a forth- 
coming paper (van den Bosch et al. , in preparation). For 
the moment, however, we ignore this discrepancy and focus 
on the kinematics only. 

Results are shown in Fig. 6. Solid dots with error- 
bars indicate the o" sat (ihost ) obtained by fitting a Gaussian- 
plus-constant. The thick dashed line indicates the best-fit 
o" sa t (Lhost) obtained using the maximum likelihood method 
(with the best-fit parameters listed in Table 2). The gray 
area indicates the expectation values obtained from our CLF 
where the upper and lower boundaries correspond to 'ran- 
dom' and 'constrained' luminosity sampling, respectively. 
Clearly, the satellite kinematics obtained from the 2dFGRS 
are in excellent agreement with these predictions. For com- 
parison, the solid line indicates the expectation value ob- 
tained using host-averaging (see discussion in Section 3.2). 
As already discussed, the flux limit of the 2dFGRS and the 
satellite-averaging results in a significant bias of the mea- 
sured cr aa ,t(ihost) compared to this host-averaged mean. 

The expectation values (gray area) are computed as- 
suming no spatial bias between satellite galaxies and the 
dark matter mass distribution. As we have shown in Sec- 
tion 3.2.1, the effect of spatial bias, and the resulting velocity 
bias, is small compared to the uncertainties resulting from 
the luminosity sampling. In other words, deviations of the 
true n sa ,t{r) from the NFW distribution assumed here will 
not significantly affect our main conclusion that the satellite 
kinematics of the 2dFGRS are in excellent agreement with 
predictions based on our CLF. 



5 SUMMARY 

Previous attempts to measure the kinematics of satellite 
galaxies have mainly focussed on isolated spiral galaxies. 
Using detailed mock galaxy redshift surveys we investigated 
to what extent a similar analysis can be extended to include 
a much wider variety of systems, from isolated galaxies to 
massive groups and clusters. Our method is based on the as- 
sumption that satellite galaxies are an isotropic, steady-state 
tracer population orbiting within spherical NFW dark mat- 
ter haloes, and that the brightest galaxy in each halo resides 
at rest at the halo centre. These assumptions are, at least 
partially, supported by both observations of cluster galaxies 
(i.e., van der Marel et al. 2000) and by numerical simula- 
tions of dark matter subhaloes (see Diemand et al. 2004 and 
references therein). Although there are most definitely sys- 
tems in which one or more of these assumptions break down, 
we hypothesized that with sufficiently large samples of host- 
satellite pairs the occasional 'perturbed', non-relaxed, sys- 
tem will not significantly influence the results. In addition, 
we have shown that the assumption of orbital isotropy only 
influences the velocity dispersion of satellite galaxies at the 
few percent level. 

We used the conditional luminosity function (CLF) for- 
malism to make predictions of the observed velocity disper- 
sion of satellite galaxies around host galaxies of different 
luminosity. We showed that the satellite weighting, which 
occurs naturally when stacking many host-satellite pairs to 
increase signal-to-noise, introduces a bias towards higher 
o"sat(£host) compared to the true, host-averaged mean. A 
further bias, in the same direction, is introduced when using 
flux-limited, rather than volume-limited surveys. Finally, we 
demonstrated that most of the uncertainty in interpreting 
the measured cr aa t(Lhost) owes to the unknown second mo- 
ment of the conditional probability distribution P(Af|Lh os t) : 
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Figure 6. Solid dots with errorbars indicate the best-fit 
Csat(ihost) obtained from the 2dFGRS data. The dashed line 
indicates the corresponding best-fit function of the form (28) ob- 
tained using the maximum likelihood method described in the 
text. The gray area indicates the expectation values obtained us- 
ing the CLF, where the upper and lower boundary correspond 
to 'random' and 'constrained' luminosity sampling, respectively. 
The solid line indicates the corresponding host-averaged mean 
Csat(ihost) an( i i s shown for comparison. 



typically a larger second moment yields higher expectation 
values. The CLF formalism is the ideal tool to properly take 
all these various biases into account. 

An important, additional problem with the interpreta- 
tion of satellite kinematics is how to deal with the presence of 
interlopers (i.e., galaxies selected as satellites but which are 
not physically associated with the halo of the host galaxy) 
and non-central hosts (i.e., galaxies that are selected as host 
galaxies, but which are not the central, brightest galaxy in 
their own halo). The abundances of interlopers and non- 
central hosts depend strongly on the selection criteria. Using 
MGRSs, constructed from the CLF, we investigated differ- 
ent selection criteria for hosts and satellites. The first one 
is identical to the criteria used by McKay et al. (2002) and 
Brainerd & Specian (2003) in their analyses of satellite kine- 
matics in the SDSS and 2dFGRS, respectively. Applied to 
our MGRS it yields only 3876 satellites, which only allows 
an analysis of satellite kinematics around host galaxies that 
span a factor 5 in luminosity. Although the fraction of non- 
central hosts is, with one percent, negligible, the fraction of 
interlopers is 27 percent. More importantly, the interloper 
fraction increases strongly with decreasing host-luminosity, 
reaching values as high as 60 percent around the faintest 
host galaxies. Contrary to what has been assumed in the 
past (e.g., McKay et al. 2002; Brainerd & Specian 2003), 
the velocity distribution of interlopers is strongly peaked 
towards AV = 0, similar to the distribution of true satellite 
galaxies. This makes a detailed correction for interlopers ex- 
tremely difficult. In fact, the method used thus far, based on 



the assumption of a uniform P(AV), typically underpredicts 
the true interloper fraction by ~ 50 percent. 

We therefore devised an iterative, adaptive selection cri- 
terion that yields large numbers of hosts and satellites with 
as few interlopers and non-central hosts as possible. Apply- 
ing these criteria to our MGRS yields 16750 satellites around 
10483 hosts. In addition, the interloper fraction is only 14 
percent and, more importantly, does not vary significantly 
with host luminosity. Because of the much larger numbers 
involved, and the reduced fraction of interlopers, satellite 
kinematics can be analyzed over two and a half orders of 
magnitude in Lhost- The resulting cr sat (Lhost) is found to be 
in good agreement with the expectation values, indicating 
that the interlopers and non-central hosts do not signifi- 
cantly distort the measurements. 

Finally, we applied our adaptive selection criteria to the 
2dFGRS. The resulting satellite kinematics are in excellent 
agreement with predictions based on the CLF formalism, 
once the various biases discussed above are taken into ac- 
count. We therefore conclude that the observed kinematics 
of satellite galaxies provide virtually independent, dynami- 
cal confirmation of the average mass-to-light ratios inferred 
by Yang et al. (2003a) and van den Bosch et al. (2003a) 
from a purely statistical method based on the abundances 
and clustering properties of galaxies in the 2dFGRS. 
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APPENDIX A: THE CONDITIONAL 
LUMINOSITY FUNCTION 

The construction of our mock galaxy redshift surveys uses 
the conditional luminosity function (CLF) to indicate how 
many galaxies of given luminosity occupy a halo of given 
mass. The CLF formalism was introduced by Yang et 
al. (2003a) and van den Bosch et al. (2003a), and we refer 
the reader to these papers for more details. Here we briefly 



summarize the main ingredients and we present the param- 
eterization used. 

The CLF is parameterized by a Schechter function: 



$(L|M)dL=|^ (i-)" exp(-L/L*)dL, 



(Al) 



where L* = L*(M), a = q(M) and $* = $*(M) are all 
functions of halo mass M. We write the average, total mass- 
to-light ratio of a halo of mass M as 



^=2 (t)o U) + U) 



(A2) 



This parameterization has four free parameters: a charac- 
teristic mass Mi, for which the mass-to- light ratio is equal 
to (M/L)o, and two slopes, 71 and 72, that specify the be- 
havior of {M/L) at the low and high mass ends, respec- 
tively. Motivated by observations (Bahcall, Lubin & Norman 
1995; Bahcall et al. 2000; Sanderson & Ponman 2003; Eke et 
al. 2004), which indicate a flattening of (M/L)(M) on the 
scale of galaxy clusters, we set (M/L)(M) — (M/L) c \ for 
haloes with M > 10 14 /i _1 M . With {M/L) c \ specified, the 
value for 72 derives from requiring continuity in (M/L) (M) 
across M = lO 14 ^ 1 M Q . 

A similar parameterization is used for the characteristic 
luminosity L*(M): 



M 



L*(M) 



3 (").««> 



/_M\~ 71 (MY 



with 



/(*) = 



r(a + 2) 
r(5 + i,i)' 



(A3) 



(A4) 



Here F(:r) is the Gamma function and F(o, x) the incomplete 
Gamma function. This parameterization has two additional 
free parameters: a characteristic mass M2 and a power-law 
slope 73. For a(M) we adopt a simple linear function of 
log(M), 



a(M) = ai 5 +?? log(Mis), 



(A5) 



with M15 the halo mass in units of 10 15 /i _1 Mq, CV15 = 
5(Mi5 = 1), and i) describes the change of the faint-end 
slope a with halo mass. Finally, we introduce the mass scale 
-Mmi n below which we set the CLF to zero; i.e., we assume 
that no stars form inside haloes with M < M m i n . Motivated 
by reionization considerations (see Yang et al. 2003a for de- 
tails) we adopt M m i n = 10 9 /i _1 M throughout. 

In this paper we use a CLF with the following pa- 
rameters: Mi = W 1112 ^ 1 M Q , M 2 = lO 1171 ^ 1 M Q , 
(M/L) = 85h (M/L) Q , 71 = 1.55, 72 = 0.46, 73 = 0.69, 
77 = —0.29 and a?i 5 = —0.99. This model is different from 
those listed in van den Bosch et al. (2003a) as it yields a 
higher, average mass-to-light ratio on the scale of clusters. 
As shown in Yang et al. (2003b), this is in better agree- 
ment with the observed pairwise peculiar velocity disper- 
sions of 2dFGRS galaxies (see also van den Bosch, Mo & 
Yang 2003b). However, none of the results presented in this 
paper are sensitive to our choice of the CLF. We verified 
that MGRSs based on either of the CLFs listed in van den 
Bosch et al. (2003a) yield virtually identical results to those 
presented here. 
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APPENDIX B: PROBABILITY DISTRIBUTION 
OF LUMINOSITY OF BRIGHTEST GALAXY IN 
A DARK MATTER HALO. 

Define L c as the luminosity of the brightest galaxy in a halo 
of mass M. It is convenient to write the conditional probabil- 
ity distribution P(L c \M)dL c in terms of the conditional lu- 
minosity function $(L c |M)dL c and a new function f(L c , M) 
which depends on how galaxy luminosities are drawn from 
the CLF: 

P(L c \M)dL c = $(L C |M) f{L c , M) dL c (Bl) 

In the case of 'constrained' drawing, L c has an expectation 
value given by eq. (5), and it is straightforward to show that 

/(L C ,M)_ j Q ifLc<Li(M) (B2) 

with Li(M) as defined by eq. (4). Note that in the MGRS 
used in this paper the luminosity of the brightest galaxy 
in each halo is always drawn constrained. Therefore, when 
computing expectation values for a sa t(Lhost) in the MGRS, 
we use P(L C \M) with f(L c ,M) given by eq. (B2). 

In the case of 'random' drawing, the situation is more 
complicated. The probability that a galaxy drawn at random 
from the CLF has a luminosity less than L c is given by 

P(< L C \M) = 1 - — L- f $(L|M)dL (B3) 
\ N ) M Jl c 

with {N)m the mean number of galaxies in a halo of mass 
M given by eq. (1). In a halo with TV galaxies, the prob- 
ability that the brightest galaxy has L < L c is simply 
[P(< L C \M)] N . Differentiating with respect to L c yields 
the probability Pjv(L c |TW")dL c that after TV drawings the 
brightest galaxy has a luminosity in the range L c ± L c /2. 
The full probability P(L c \M)dL c follows from summing 
Pjv(L c |M)dI/ c over TV, properly weighted by the probability 
P(N\M) that a halo of mass M contains TV galaxies. This 
yields 

oc 

/(L c , M) = — L- V TV P(N\M) [P« LclM)]"- 1 (B4) 

{n)m £i 

If (TV) M < 1 then 

(see Section 2.2), so that f(L c ,M) = 1. For (TV) M > 1 we 
have that 

P(iV|M) = {(N (N~-% 1 6XP(1 " {N)m) (B6) 
(see Section 2.2). Substituting (B6) in (B4) and using that 

°° k 

^l! (fc + 1) =(l+z)exp(z) (B7) 

fe=0 

(see eq. [1.212] in Gradshteyn & Ryzhik, 1980) yields 

c 

(TV), 



/(L c , M) = I 1 — -i— I exp(-C) (B8) 



with 

{__ 
(TV) 



. = Wm -i J°° Hm)AL (B9) 
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